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Abstract 

We improve Luczak's upper bounds on the length of the longest cycle in the random 
graph Q(n,M) in the "supercritical phase" where M = n/2 + s and s = o{n) but 
n 2 / 3 = o(s). The new upper bound is (6.958 + o(l))s 2 /ra with probability 1 — o(l) as 
n — > co. Letting c = 1 + 2s /n, the equivalence between G(n,p) and G(n, M) implies the 
same result for G(n,p) where p = c/n, c — ► 1, c — 1 = w(n -1 / 3 ). 



1 Introduction 



The probability space G(n, M) of all n- vertex graphs with M edges under the uniform distri- 



Q\ \ bution is also known as the uniform random graph model. It is one of the earliest models of 
random graphs, originating in a simple model introduced by Erdos [5]. We say that G(n, M) 
has a property asymptotically almost surely (abbreviated a.a.s.) if the probability of this 
^ event is 1 — o(l) as n — > oo. Much of the interest in this model comes from the study of 

its its asymptotically almost sure (also abbreviated a.a.s.) properties as the dependence of 
M upon n is varied. This change from a sparse graph to a dense graph, as M increases 
more quickly with n, is called the evolution of the random graph. One important property 
is the number L of vertices in the largest component of Gin, M). (If there is more than one 
component with the maximum number of vertices, we use the lexicographically first among 
largest components.) When M = cn/2 for constant c, Erdos and Renyi [3] showed that the 
number of vertices in the largest component of Q(n,M) is a.a.s. O(logn), 0(n 2 / 3 ), or 0(n) 
according to whether c < 1, c = 1, or c > 1, respectively. 

Because of this dramatic change in the structure of G(n,M), we often call M = n/2 a 
"phase transition". Further research showed that the phase transition extends throughout 
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the period M = n/2 + cn 2 / 3 for constant c in the sense that, for this range of M, L = c'n 2 ' 3 
with a distribution over the constant d . As a result, this range of M is known as the critical 
period. For s = s(n) satisfying n 2 / 3 = o(s) but s = o(n), the range M = n/2 — s is known as 
the subcritical phase while the range M = n/2 + s is known as the supercritical phase. For 
M in the supercritical phase, Q(n,M) a.a.s. has a unique largest component on (4 + o(l))s 
vertices and every other component has fewer than n 2//3 vertices. A "giant component" has 
emerged. 

Another well-studied graph property is its circumference, the length of its longest cycle. 
The circumference I of Q (n, M) also changes dramatically during the phase transition, but 
the way it changes is not entirely understood. Let u = u)(n) — ► oo. When M = cn/2 for fixed 
c < 1, the circumference of Q(n,M) is a.a.s. at most u ([6J, Corollary 5.8). In the subcritical 
phase, the circumference I a.a.s. satisfies l/u < n/s < Iuj ([IT]. Section 5.4). During the 
critical period M = n/2 + 0(n 2 ^ 3 ) it a.a.s. satisfies l/u < n 1 / 3 < lu ([UJ, Section 5.5). 
But for larger M there are not such good estimates for the circumference. (Of course, when 
M = n(\ogn + log log n + u)/2 the circumference is a.a.s. equal to n as the graph is a.a.s. 
Hamiltonian [14J.) When M = cn/2 for fixed c > 1, there are several known a.a.s. lower 
bounds on the circumference of the form (f(c) + o(l))n [H [7J [TO]. One of the earliest and 
most significant breakthroughs was given by Ajtai, Komlos and Szemeredi [I], who also showed 
an equivalence between the problems of finding paths of length (f(c) + o(l))n and finding 
cycles of length (/(c) + o(l))n. Luczak [16] has shown that the circumference of Q(n,M) is 
a.a.s. between (16/3 + o(l))s 2 /n and (7.496 + o(l))s 2 /n when M = n/2 + s for s = o{n) 
and v?l 3 = o(s). Kim and the second author [13] have improved Luczak's lower bound to 
(Q + o(l))s 2 /n. 

In this paper we improve upon Luczak's upper bound as follows. 

Theorem 1 Let M = n/2 + s with n 2 / 3 = o(s) and s = o(n). The circumference ofQ(n, M) 
is a.a.s. at most (6.958 + o{l))s 2 /n. 

In proving his result, Luczak focused on the core and kernel of Q(n,M). The core of 
a graph is its maximal subgraph of minimum degree at least 2. The prekernel of a graph 
is obtained from the core by throwing away any cycle components. The kernel of a graph 
is obtained from the prekernel by replacing each maximal path of degree-2 vertices by a 
single edge. We say that a graph is a prekernel (respectively, a kernel) if it is the prekernel 
(respectively, kernel) of some graph. 

Luczak's main insight was that, for this range of M, the kernel is much like a random 
3-regular graph, and the core is much like the graph formed from the kernel by randomly 
subdividing its edges about (8 + o(l))s 2 /n times. A random 3-regular graph a.a.s. contains 
a Hamilton cycle. This gives a cycle in Q(n,M) containing about (2/3) x (8 + o(l))s 2 /n = 
(16/3 + o(l))s 2 /n vertices of the core. This was Luczak's lower bound on the circumference. 
His upper bound came from viewing the core as constructed from the kernel together with 
a sequence of numbers, summing to (8 + o(l))s 2 /n, describing how many degree-2 vertices 
belong on each edge of the kernel. From probability theory, the sum of the largest two-thirds 
of the terms of such a random sequence is at most (7.496 + o(l))s 2 /n. 

We obtain our result by a different, more detailed study of how a cycle can pass through 
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such a structure. Our main tool is the kernel configuration model, introduced in [18] to 
facilitate arguments like Luczak's. 

Following Luczak's example, it is helpful to put weights on the edges of the kernel; the 
weight of an edge tells us how many times the edge should be subdivided to recover the 
core. These weights form a random sequence whose asymptotic properties we investigate in 
Section 2. In particular, we show that any bounded number of terms in such a sequence 
behave like independent random variables with exponential distribution. We also show that 
when a function of a bounded number of these terms is summed over many sets of such terms, 
the result is concentrated about its expected value. These properties are needed in Section 3 
where we establish an a.a.s. upper bound on the weight of the heaviest cycle in a pseudograph 
with random edge weights. The upper bound is expressed in terms of a family of constants, 
some of which we explicitly calculate in Section 4. In Section 5 we prove an a.a.s. upper bound 
on the circumference of a random prekernel with a degree sequence that resembles a random 
3-regular graph with subdivided edges. In Section 7 we use this result to prove Theorem 1 
after, in Section 6, establishing that the degree sequence of the prekernel of Q(n,M) indeed 
shows the required resemblance. 

2 Random sequences 

Let fl be the probability space, equipped with the uniform distribution, of all sequences of m 
positive integers (X 1 ,X 2 , . . . , X m ) summing to N. We are interested in the asymptotic value 
of certain functions of these random variables. Letting uo = u(N) — > oo, our asymptotics are 
in terms of N — > oo, uniformly over all m satisfying uj < m < N/uo. For the rest of the paper 
we write \x = N/m. 

Our first result tells us the expected value of certain functions of Xi,X 2 , . . . ,Xj for j 
bounded. 

Lemma 2 Let g be a nonnegative integrable function of a bounded number j of nonnegative 
variables. Suppose that for some C and d, g(xi, . . . , Xj) < G{x\ + • • • + Xj) d for all x%, . . . , Xj. 
Then, 



Since the X t are identically distributed, the above theorem also holds when {X\, . . . , Xj) is 
replaced by (X a m, . . . , X a ^) for any j distinct a(l), . . . , a(j) in {1,2,..., m}. Furthermore, 
the error represented by o(l) is independent of a. 

The next result states that when such a function is summed over a in a sufficiently rich 
family, the sum is asymptotically almost surely (a.a.s.) concentrated about its expected value. 

Lemma 3 Let f be a nonnegative integrable function of a bounded number k of nonnegative 
variables. Suppose that for some C and d, f(xi, . . . , < C(x\ + ■ ■ • + Xk) d for all Xi, . . . , x^. 
Define the constant 




Xj dxi ■ ■ ■ dxj + o(l) 



E* : 




• dxk- 
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and assume E* > 0. Let S be a set of k-tuples with entries from {1, 2, ... , m}, with each k- 
tuple having distinct components. Let I = I(S) G S x S be the pairs of tuples which intersect; 
that is, 

I = {(a,r)eSxS | {(7(1), . . . , a(k)} n {r(l), . . . , r(fc)} ^ 0}. 
//|/| =o(|,S| 2 ) then 

J2f(^,...,^)=(E* + o(l))\S\ 

a.a.s.; that is, with probability 1 — o(l). Furthermore, the o(l) terms may be bounded inde- 
pendently of S. 

These types of concentration results are often proved using martingales or inequalities like 
Talagrand's; however, because we are aiming for such a coarse result, a simple application of 
Chebyshev's inequality will suffice for the proof. 

2.1 Distribution of terms 

In this section we establish some preliminary results about the distribution of the positive 
terms X±, X 2 , ■ ■ ■ , Xj for bounded j. It is an exercise in basic counting to show that the number 
of sequences in f2 is It immediately follows that for positive integers t±,t2, ■ ■ ■ ,tj, the 

number of sequences in Q with X 1 — ti, X 2 — t 2 , . . ., Xj — tj is 



B(t) 

where t — t± + t 2 + • • • + tj. 



N-l-t 
m — j — 1 



Proposition 4 Let x satisfy x < y/m/u and x < fi/u. For positive integers t < xjj, we have 

^ = (l + 0(C 1 ))^e-^. 

Proof. 



B(t) fN-l-t\{N-l x ~ 



\Q\ \m — j — 1/ \m — 1 

- fri^fn^f 



N-i I \ LL N -t-m + i 
i=i ' x i=i 

m—l 



= (fi (l-^-))^'(l + 0(m- 1 ) + 0(t/iV) + 0(m/iV)). 
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Since m > u>, x < m/uj and m < N/u, the error term is 0(l/u). Also 



N-i N\ \N 

= l-^l + OOi" 1 )); 



m—l 



J]Yl__J_Jj = e -^(l + 0(tm 2 /iV 2 + mt 2 /iV 2 )) 

= e-*^(l -O^- 1 )). | 

Corollary 5 Lei x > fre /ixec?. For any positive integers ti, t 2 , . . . , tj summing to t < xfi we 
have 

P[Xi = ti,X 2 = t 2 , . . . ,Xj = tj] = (1 + ©(a;" 1 ))//-^-*^. 
Next we bound the probability of larger terms. 
Lemma 6 Let x > be fixed. For positive integers t\, t 2 , . . . , tj summing to t > xfi we have 



P[Xx = ti, X 2 = t 2 , . . . , X 3 = tj] < 2^e~ x \l 
when N is sufficiently large. 

Proof. If B(t) =0 then the required probability is zero and we are done. Otherwise, B(i) is 
nonzero for all positive integers i < t and the probability which we must estimate is 

B «> - m- B (M) n m 



\n\ 1 1 VL 11 B(i - 1) 

By Proposition SI the product of the first two terms is (1 + 0{oj~ 1 )) e~\- x ^i V* . This is less 
than 2\i~ie~ x when N is sufficiently large. To bound the remaining product, we estimate the 
ratio 

Bjj) (m-j-l) 

B(i - 1) ~ fiv-i-i+iN 

m — j — 1 



= 1 - 

< 1 

< 1 



N-i 
m — j — 1 

N 
m/2 



A 7 

where the last inequality holds for iV sufficiently large. So, for N sufficiently large, 

n < f 1 -^) ^ I 1 

= 1 

The result follows. 



, B(i-1) V 2^/ ~ V 2// 
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2.2 Proof of Lemma [2] 

By the definition of expected value, we have 

g(^,...,^)}=J29(-,..., t Ap[X 1 = t 1 ,X 2 = t 2 ,...,X J =t J } 



E 



where the sum is over all positive integer j-tuples t±, t 2 , ■ ■ ■ , tj. 

Fix x > 0. Let us split the sum into two parts, Si(x) being the sum over j-tuples where 
each ti < xfi, and S 2 (x) being the remainder. We will show that, as iV — > oo, 



Si(x) — > / • • • / g(xi,...,Xj)e Xl x ' 2 " Xj dx l ■ ■ ■ dxj, 
Jo Jo 

while 

\S 2 (x)\ < Ke~ x/2 

for some constant K. As x grows, | £2 0*01 approaches and S\{x) is nonnegative and nonde- 
creasing since g is nonnegative. So, taking x — > 00 proves the lemma. 

We begin by estimating Si(x). These terms have each tj < xfi, so we use Corollary [5] to 
estimate the probabilities as follows. 

= E ••• E g(-,---, t Ap[Xi = t 1 ,x 2 = t 2 ,...,x j = t j ] 

tl<X[l tj<Xfl ^ ^ ^ ' 

= E ••• E •"(- • ocw- 1 ))^^ 4 -^- 

t\<XfJL tj<XfJL ^ ^ A* / 

Since Oiu^ 1 ) is independent of the ti, this becomes 



:i + o( W -))E-E»(^-.|)f 



-3 e -(*l+-+*i)/M_ 



tl <Xfl tj <Xfl 

Letting M = x/i we get 

(i +0( O)E---E 9 («.£.....*,£)^™»(^ 

t\<M tj<M 

As iV -> 00 we have M — > 00 and this expression becomes the Riemann integral 

g(x 1 , . . . , Xj)e~ Xl ~ X2 Xj dx 1 ■ ■ ■ dxj 



as required. 

The terms of the sum S 2 (x) are indexed by j-tuples t 1 ,t 2 , . . . , tj with at least one ti > x\i. 
Consider such a term, and let t — t\ + t 2 + • • • + tj. For N sufficiently large, the absolute 
value of the term is 

9 £• ■ • • • t i) P|A '' = «" x -- = ^ ■ ■ ■ • x > = ^ < c © ' i 1 - £ 



by the hypotheses about g and Lemma El The number of terms in £2(2;) indexed by j-tuples 
summing to t is at most < (t + < (2t) J-1 for A" (and hence t) sufficiently large. 

Thus, for iV large, we have 



\s 2 (x)\ < J2( 2t ) j ~ lc (l) 2 » 



^' 1 1 - 2, 



2 \ t—XjX 



The factor (1 — l/(2/i)) _x/i approaches e x ^ 2 as N ^ oo. The remaining sum is 

E^-'O-^)' s E('+i)('+2)... (i+ ,- +(i -i)(i-^)' 

= (j + d-l)!(2/^' +d 

using the Maclaurin series expansion k\(l — x)~ k ~ 1 = ^ t> Q{t+l){t+2) ■ ■ ■ Combining 
this with the previous results, we get the desired estimate. This proves the lemma. | 



2.3 Proof of Lemma [3] 

For each a in S, define the random variable Y a := f(X a ^/fi, . . . ,X a ^/fi). As we remarked 
after Lemma [2], each of these variables has the same distribution as the random variable Y\ : = 
f[Xxj\i, . . . ,X k /fi). In particular, the expected value is the constant E*, up to an additive 
error of o(l). We will establish the concentration of the random variable Z := Ylaes^ 
by showing that the variance V[Z] is o((EZ) 2 ). The lemma then follows by Chebyshev's 
inequality. 

We begin by estimating 

(ez) 2 = ey;ey; 

(<r,r)esxs 

= £ (E* + o(l))(E* + o(l)) 

(u,t)£SxS 
(<7,r)eSxS 

= &(\s\ 2 ) 

(using the lower bound assumed on E* in the lemma). We can write the variance as 
V[Z) = E[Z 2 ]-(EZ) 2 

= £ (E[Y a Y T ]-EY a EY T ) 

(u,t)£SxS 

= Yl ( E [ Y M - Ey ff Ey T ) + Yl i E l Y ° Y r] ~ EK.EK). 

((7,r)e/ ((T,t)€{SxS)\I 
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To study the terms of the second sum, let (a, r) 6 (5 x 5) \ /. By Lemma El we have 
E[Y a Y T ] 



E 



/ 



, . . . J 

/i /i 



./ 



X 



r(fe) 



, . . . , 



oo />oo 



/(xi, . . . , x k )f(x k+1 , . . . ,x 2 k)e Xl "' X2k dxx ■ ■ ■ dx 2 k + o{\) 

(poo roo \ 2 

/ •••/ /(x 1 ,...,x fc )e- Xl - a;2 --^ C /x 1 --- C /x fc J +o(l) 

EY a EY T + o(l) 



where o(l) is independent of cr and r. So the second sum is od^l 2 ). To study the terms of 
the first sum, we can be more crude. By Lemma [2] and the remark following it, we know that 
each EfY^y,-] and EY^EY,- depends only on the tuple positions where a and r intersect, and 
each value is 0(1). So the first sum is 0(|/|), which is odS 1 ) 2 ) by hypothesis. Combining the 
two sums, we see that the variance of Z is od^l 2 ), which is o((EZ) 2 ), as required. | 

3 Heavy cycles in a weighted pseudograph 

In the introduction we saw that the problem of bounding the circumference of Q(n,M) is 
connected to the problem of bounding the weight of the heaviest cycle in a certain edge- 
weighted graph. In this section we study a graph, technically a pseudograph since it may 
have loops and/or multiple edges, whose m edges are randomly weighted by positive integers 
summing to N. The sequence of weights is chosen uniformly at random from among all 
such sequences. Equivalently, we can think of the weights as being generated by the following 
random process applied to make a sequence of pseudographs, beginning with the given one. At 
each step, choose an edge uniformly at random from the current pseudograph and subdivide 
the edge into two edges. Repeat the procedure until the resulting pseudograph has exactly 
N edges. For each edge in the original pseudograph, define its weight to be the number 
of edges into which it has been subdivided. These weights form a sequence of m positive 
integers summing to N. There are exactly (N — m)\ ways that the process can form a given 
sequence, so the sequence is chosen uniformly at random from among all such sequences. 
Another random process for generating the weights initially gives a weight of 1 to each edge, 
then selects an edge at random with probability proportional to the weight of the edge and 
increments the weight of the selected edge by 1. The selection and incrementing is repeated 
until the total weight is N. It is easy to see that this process is equivalent to the previous 
one. 

Given a subgraph of an edge-weighted pseudograph, we define the weight of the subgraph 
to be the sum of the weights on its edges. To establish an upper bound for the weight of a cycle 
in a large pseudograph, we will consider the intersection of the cycle with small trees in the 
pseudograph. The intersection of the cycle and the small tree will form a set of vertex-disjoint 
paths which begin and end at leaf vertices of the tree. We will use the maximum-weight set 
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of such vertex-disjoint paths to bound the weight of the intersection. This motivates the 
following definitions. 

Fix an integer k > 2. A biased tree T on k edges is a tree on k edges with each non-leaf 
vertex having degree 3 and each edge having a nonnegative number bi called its bias. We 
may assume that the sum of the biases b = (61, b 2 , • • • , b k ) is 1- 

Let V be the set of all maximal subgraphs of T which are a union of vertex-disjoint paths 
which begin and end at leaf vertices. Define the function 

f T (xi,x 2 , ■ ■ ■ ,x k ) = max ^ biXi 

i: ei eE(P) 

and the constant 

/>oo />oo />oo 

£ T = / / •••/ f T (x 1 ,X2,...,x k )e- Xl - X2 —- Xk dx 1 dx 2 ---dx k . (3.1) 
Jo Jo Jo 

If Xi,X2, . . . ,x k are weights on the edges of T, we can think of / as the maximum "biased 
weight" of any graph in V . 

We say that the positive constant c* is k-admissible if E T < c* for some biased tree T on 
k edges. 

Lemma 7 Fix an integer k > 2. Let the positive number c* be k-admissible. Let G = 
G(n) be a pseudograph on v = v(n) ^00 (as n 00) vertices and m = m{n) edges with 
minimum degree at least 3. Suppose the subgraph B of G induced by cycles of length at most 
k (including loops and parallel edges) and edges incident to vertices of degree greater than 3 
satisfies \E(B)\ = o(v). Let N = N(n) be a positive integer satisfying m = o(N). On the 
edges of G put weights, a sequence chosen uniformly at random from among all sequences of 
m positive integers summing to N . Then, the heaviest cycle in G has weight a.a.s. at most 
c*N. 

Proof. Denote the edges of G by w±, w 2 , ■ ■ ■ , w m and their random weights by Xi,X 2 , ■ ■ ■ , X m . 
We estimate m by recalling that in any graph the sum of the vertex degrees equals twice the 
number of edges. Since G has minimum degree at least 3, we have 2m > 3v. Since G has 
only o(v) edges incident to vertices of degree greater than 3, we have 2m < 3v + o(v). Thus 
m ~ 3f/2. 

For a subgraph S of G, define its k -neighbourhood T(S) to be the subgraph of G reachable 
from S by paths of length at most k. Recalling that the subgraph B of G contains all edges 
incident with vertices of degree greater than 3, its /^-neighbourhood satisfies \E(T(B))\ < 
2 k \E{B)\= o{v). 

Let C be a cycle in G. Its weight wt(C) is 

m 

wt(C) := ^I 3 /Ke£(C)) 

E X j I(w j eE(C))+ £ X 3 I(w 3 eE(C)) 

j: Wj eE(T(B)) r.w^E{Y(B)) 
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where 1(a) is the indicator function equal to 1 if a is true and otherwise. The expected 
value of each Xj is /i — N/m, so the first sum has expected value at most \E(T(B))\N/m = 
o(vN/m) = o(N) since m ~ 3v/2. It follows by Markov's inequality that the first sum is 
a.a.s. o(N). Thus a.a.s., 

wt(C)=o(N)+ Yl X 3 I{w 3 eE{C)). (3.2) 

j:w^E{T{B)) 

Since c* is A;-admissible, there is a biased tree T on k edges with < c*. We will study 
the copies of T in the graph G\B. Let S be the set of 1-1 homomorphisms a mapping T to 
G\ B. Since k > 2, each a is uniquely defined by the mapping it induces between the edge 
sets. We write a = (er(l), <r(2), . . . , cr(k)) and interpret a(i) = j to mean that a maps edge e, 
of T to edge Wj of G. 

Consider the random variable 

k 

o-es i=i 

Expressing Z in terms of the edges of G we may write 

m k 

z = EEEwk^(W)=j) 

j = l aeS 1=1 
m fc 

i=l i=l 

For each edge iUj of G not in E(T(B)) the /c-neighbourhood of u»j is the depth-/c tree with 
internal vertices of degree 3. Thus, \{a e 5 | = j}| equals some constant independent 
of j. In fact, this constant is a number a, independent of i, because any a in this set is 
determined by choosing one of the 2 ways to embed onto Wj and then, moving outward 
from ej, making one binary choice for each non-leaf vertex of T. On the other hand, for an 
edge Wj G E(T(B)), \{a G S \ a(i) = j}\ is at most a (by the same argument, recalling that 
some choices are impossible because a maps into G \ B), so we have 

k 

z = Yl E E a WK-G£(co) 

j:^-6E(r(B)) j: Wj gE(T(B)) i=l 

E °(^')+ E aXjIiwj E E(C)) 

j: Wj £E{r(B)) j:w^E(Y(B)) 

since 

E,-=i6< = 1- The first sum has o(v) terms, each having expected value 0(N/m), so the 
sum is a.a.s. o(vN/m) = o(N) by Markov's inequality. We now have a.a.s. 

Z = o(N) + a XjI( Wj E E(C)). 

j<£E{T{B)) 
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Combining this result with (13.21) we get a.a.s. 

wt(C) = ^Z + o(N). (3.3) 

Returning to the definition of Z, we notice that the inner sum is the "biased weight" of the 
edges of C passing through the copy of T given by a. These edges must form vertex-disjoint 
paths beginning and ending at leaves of the copy of T given by a, so this sum is at most 
fr (^ct(i) , X a (2) , X<r(k) ) • So 

(i),X a (2), . . . , X a (ky). 



We will estimate this sum by applying Lemma [3] to 

- E fT(Xa(l), X a (2), -Xa(fc)) = E f T ( 



X a (l) X a (2) X a {j.) 



To verify the hypotheses of Lemma [3] we first note that fr{xi, %2, ■ ■ ■ , Xk) is nonnegative, 
piecewise linear (and hence integrable), and bounded above by Xi + x 2 + - ■ ■ + We estimate 
\S\by 

\S\ = E |{^ e 5 | o-(l) = j}| 

j: W] dE(G) 

E l{^S|a(l)=j}|+ E l{^|<x(l)=j}| 

i Mj eB(r(B)) j:io 3 -gE(r(B)) 

= E a 

i^£(r(B)) 

= o(v) + (1 + o(l))ma 
~ ma 

using o(f) = o(m). To estimate the cardinality of the set I of pairs (a, r) G S x S for 
which cr and r represent intersecting copies of T, consider any edge / in G \ B. As we have 
seen previously, there are at most a copies of T using /. So, a crude upper bound for |/| is 
a 2 \E{G \B)\ < a 2 m, giving us |/| = odS 1 ) 2 ) as required. Recalling the definition of Et from 
(13.11) we may apply Lemma [3] and conclude a.a.s. 

-Z < (E T + o(l))\S\ 
= (E T + o(l))am. 
Combining this with Equation (13.31) we get a.a.s. 

wt(C) < -fi(E T + o(l))am + o(N) 
a 

= (E T + o(l))N + o(N) 
< c*N 
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as required. | 

Remark 1. A random 3-regular graph a.a.s. satisfies all of the hypotheses of Lemma [3 
The lemma thus gives an upper bound which holds a.a.s. on the weight of the heaviest cycle 
in a randomly-weighted random 3-regular graph. 

Remark 2. There are essentially two ingredients in the proof of Lemma The first 
ingredient is a method for bounding the weight of a cycle in a large edge-weighted 3-regular 
subgraph. The second ingredient is the argument that the weight of the heaviest cycle does not 
change much when the remainder of the graph is included. This second ingredient is implicit 
in Luczak's proof of his upper bound on the circumference of Q(n,M) in the supercritical 
phase |16j . It is the first ingredient that is the new contribution. 

Remark 3. For the task of bounding the weight of a cycle in a large edge- weighted 
3-regular subgraph, one might suggest investigating the weight of the least-weight matching. 
Certainly the complement of a Hamilton cycle in a 3-regular graph forms a perfect matching. 
But, in general, the maximum- weight cycle is not necessarily Hamiltonian. Thus, its removal 
from the graph does not always form a perfect matching. 



4 Computing Et 

Recall the definitions of /t(x) = fa{xi,x 2 , • • • , %k) and Et from (13.11) . In the previous section 
we saw that the weight of the heaviest cycle in a certain edge-weighted pseudograph can be 
bounded in terms of Et for any biased tree T. In this section we compute the value of Et 
for a few specific biased trees T. For some trees T we also state the biases b which make Et 
as small as possible. 

Proposition 8 Let T be the biased tree on two degree-3 vertices and four leaf vertices with 
bias b on the edges incident to leaves and bias 1 — 46 on the remaining edge, where b is the 
unique zero of 1056 3 - 90b 2 + 246 - 2 on < b < 1/4. Then 

_ 4(1 - 3&)(56 2 -56+1) 
T ~ (76-2) 2 

which lies in (0.8797, 0.8798) and hence c* = 0.8798 is k-admissible for k = 5. 

It can be shown that, for this tree, no other choice of biases b yields a lower value of Et- 
See [12J for details. 

Proof. We begin by letting v and w denote the two non-leaf vertices of T. Let e% and C2 
denote the two edges which are each incident to v and a leaf. Denote by e% the edge joining 
v to w, and christen the other two edges as e 4 and e$. Under this ordering the biases are 
b = (6,6,1-46,6,6). 

To evaluate the integral Et we exploit some of its symmetry. It suffices to integrate over 
only nonnegative Xi, x 2 , x 3 , x 4 , x 5 satisfying X\ < x 2 and a; 4 < x 5 and multiply the final result 
by 4. For such points, only two of the P G V can attain the maximum in the definition of 
fa, giving us 

/t(x) = fa{x\, x 2 , x 3 , x 4 , x 5 ) = max{6xi + bx 2 + bx 4 + 6x 5 , bx 2 + (1 - 46)x 3 + 6x 5 }. 
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We split the region of integration into two parts, according to whether 

bxi + b%2 + b%i + bxn > 6x 2 + (1 — 46)23 + bx$, 
i.e. b{x\ + X4)/(l — 46) > x 3 . The integrals are 



1-46 



£2 /"£5 

6(xi + x 2 + £4 + x 5 )e~ xl ~ X2 ~ X3 ~ X4 ~ X5 Gfei<i£ 2 • • • dx 5 

X2=0 J £5=0 J £1=0 J £4=0 J £3=0 

and 

pco pco PX2 rx$ poo 

/ / / / / (6x 2 + (1 - 46)x 3 + bx 5 )e- Xl - X2 - X3 ~ X4 - X5 dx 

Jx 2 =0 Jx 5 =0 Jx 1= Jx 4 =0 Jx 3 = Kx ^ b 4) 

which, when evaluated, added together, and multiplied by 4, give us 

4(1 - 36)(56 2 - 56 + 1) 
T ~ (76 - 2) 2 ' 

The result follows by simple computations. I 

Proposition 9 Let T be the biased tree on three degree-3 vertices and five leaf vertices with 
bias 6 on edges incident to leaves and bias (1 — 5b)/ 2 on the other edges, where 6 is the unique 
zero of -39936 4 + 27656 s + 145 26 5 - 8046 2 + 1056 - 5 on < 6 < 1/5. Then 

7266 4 - 60 16 3 + 24 56 2 - 556 +5 
T ~ 5(l-6)(46-l) 2 

which lies in (0.8741, 0.8742) and hence c* = 0.8742 is k- admissible for k = 7. 

For the tree in the above proposition, it can be shown [12] that no other choice of biases b 
yields a lower value of Et- 

Proof. We may view T as the complete binary tree on six edges with one additional edge 
e\ joining the root to an additional vertex. Denote by e 2 and e% the other edges incident to 
the root. Denote by e 4 and es the edges incident with e 2 . Denote by and the two edges 
incident with e%. Under this ordering, the biases are 

1-56 1-56 



, 6, 6, 6, 6 

By symmetry we may compute Et by integrating over only X4 < X5 and x§ < xj and 
multiplying the final result by 4. In this range, fx is the maximum of four expressions, 

1. 6(x 4 + £ 5 + x e + £ 7 ), 

2. 6x1 + (1 - 56)x 2 /2 + b(x 5 + x 6 + x 7 ), 

3. (1 - 56)(x 2 + £3)/2 + 6(x 5 + £ 7 ), and 
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4. bx! + (1 - 56)rr 3 /2 + 6(x 4 + x 5 + x 7 ). 

To compute the integral, the region of integration is divided into four parts, according to 
which of the above expressions gives the maximum. We present the details for the first part 
only. 

The first expression exceeds the other three if and only if bx 4 > bxi + (1 — 5b)x 2 /2 and 
bx Q > bxi + (1 — 5b)x 3 /2. To express the integral over this part as an iterated integral, we 
divide the part into two regions, according to whether x 4 > x 6 or not. The region on which 
x 4 > xq gives the integral 

,-OC 0X4, PX 6 r \i 5b r ^ noo rOO 

III / / / ldx\dx2dx 3 dx 4 dx^dx^dx-j 

J X4=0 J Xe=0 Jxi=0 J X2=0 J X3=0 J X$=X4 J X~i=X(, 

where the integrand is 

/ = b{x 4 +X 5 +X 6 + X7 ) e -*i-*'-**-**-**-**-*7 

which evaluates to 

1 (736 - 17)6 3 



3 



100 (46 - 1) 
The region on which x 6 > x 4 gives 

1 (736 - 17)6 3 



100 (46-l)(166 2 - 86 +l) 

The other three parts can be expressed and evaluated similarly, giving a final result of 

7266 4 - 6016 3 + 24 56 2 - 556 + 5 
T ~ 5(1 -6) (46- l) 2 ' 

The result follows by simple computations. | 

Our final computation is for a nine-edge tree. Its lengthy proof uses the same method 
that we used in the previous computations, so we omit it. 

Proposition 10 Let T be the biased tree on four degree-3 vertices and six leaf vertices with 
bias 6 on edges incident to leaves and bias (1 — 66)/4 on the other edges, where b is the unique 
zero of 

23 728 9 56 6 - 30 1 32 06 5 + 150 14 1 66 4 - 38 9 2 3 26 3 + 560 1 66 2 - 42 2 46 + 128 

on < 6 < 1/6. Then 

-2(-128 + 24 4 86 - 1785 66 2 + 60 3 726 3 - 88 9 386 4 + 3766 56 s ) 
T ~ 9(32 - 60 06 + 42 126 2 - 13 1 226 3 + 153 96 4 ) 

which lies in (0.8696, 0.8697) and hence c* = 0.8697 is k-admissible for k = 9. 

Computer simulations suggest that the value of c* decreases only slightly as k is increased 
further so we do not pursue this here. 
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5 Circumference of a random prekernel with given de- 



gree sequence 

In the previous sections we have established Lemma [7J an a.a.s. upper bound on the weight 
of the heaviest cycle in certain randomly-edge-weighted pseudographs. In this section we use 
that lemma to establish an upper bound on the circumference of a random prekernel whose 
degree sequence satisfies certain conditions. In later sections we will see that the degree 
sequence of the prekernel of Q(n,M) a.a.s. satisfies these conditions, allowing us to use this 
result to establish an a.a.s. upper bound on the circumference of the prekernel of Q(n, M). 

One of the challenges in this section arises because Lemma [7] is a statement about non- 
random pseudographs with random edge weightings, while we are proving a statement about 
random prekernels. The kernel configuration model of Pittel and Wormald, described below, 
allows us to rigorously make this transition. It combines a pairing model, for generating the 
kernel, with a random sequence of weights on the kernel edges. 

Another challenge in this section is to show that the conditions on the degree sequence 
imply that the hypotheses of Lemma [7J are satisfied. One hypothesis requires that there are 
few edges incident with vertices whose degree exceeds 3. Another hypothesis requires that 
the number of short cycles in the kernel be small. In Luczak's proof of his upper bound for 
the circumference of Q(n, M) in the supercritical phase, he established the first hypothesis by 
direct enumeration over degree sequences. (See the proof of Theorem 10 in [IS].) However, 
Luczak does not require the second hypothesis, so we will need to prove it here. We will 
see that, without much extra effort, our proof of the second hypothesis gives an alternative 
derivation of the first hypothesis. In [5] and [19] there are results about short cycles arising in 
this pairing model. However, these results apply only when the maximum degree is bounded, 
so they cannot be used for our application. 

We are interested in studying prekernels with a given degree sequence d = (dj). We say 
that d is a prekernel degree sequence if its number of terms v = f(d) is finite, each term is a 
positive integer at least 2, and r = r(d) = ^(dj — 2) is even. For j = 2, 3, ... we define 



The kernel configuration model 7~t{d) is used to generate prekernels with degree sequence 
d. It has been used successfully to calculate improved estimates for the size of the core, 
excess, and tree mantle [18]. We describe the model next. 

For each i with dj > 3 create a set Si of d, points. Let V be the set of perfect matchings 
on the union of these sets of points and choose P G V uniformly at random. Then, assign 
the remaining numbers {i : dj = 2} to the edges of the perfect matching and, for each edge, 
choose a linear order for these numbers. The assignments and the linear ordering, denoted 
by /, are chosen uniformly at random. The pair (P, /) defines a random configuration in the 
model H(d). 

Each configuration (P, /) corresponds to a prekernel G(P, f) by collapsing each set Si to a 
vertex (producing a kernel K(P)) and placing the degree-2 vertices on the edges of the kernel 
according to the assignment and linear orderings. 
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Lemma 11 Let d = d(n) be a prekernel degree sequence satisfying v = v(d) — > oo ; r = 
r(d) — ► oo ; r = o{v), D 3 = D 3 (d) ~ r, and 

E (?) < 

i:di>3 V 7 

Fix a positive integer k > 2 and suppose that the positive constant c* is k- admissible. For a 
random configuration (P, f) in Ti{d), the longest cycle in G(P, f) has length a.a.s. at most 
c*v as n — > oo. 

Proof. Define V* to be the set of P G V for which K(P) has at most y/r edges in cycles of 
length at most k. We will show that a random configuration (P, f) a.a.s. has P G V*. Recall 
that P is a random perfect matching on the points in the union of the Si. For j G {1,2,..., k}, 
the number of ways of choosing j pairs of points to form a cycle is at most 




The probability that j given pairs of points appear in the pairing P is asymptotic to 

i:di>3 

since j is bounded. Now 

i:di>3 i:di>3 

= r 

so the expected number of cycles of length j is 0{r^r~^) = 0(1). Since is fixed, the expected 
number of edges in such cycles is also 0(1). By Markov's inequality, the number of edges 
in cycles of length j is a.a.s. bounded above by any function uo = uo[n) — > oo, in particular 
s/r/k. Thus, a.a.s. P G V* . 

Let (P, /) be a random configuration from TC(d). Define G'(P, f) to be the edge-weighted 
pseudograph whose underlying pseudograph is K(P) and whose edge-weight on e, for each 
edge e, is one more than the number of vertices assigned to e by /. Let A be the event that the 
heaviest cycle in G'(P, f) has weight at most c*v. Let Pq be the P* minimizing P[A | P — P*] 
over P* G V*. The minimum exists because V* is finite. Next we verify that, conditioned on 
P = Pq, G'(P, f) satisfies the hypotheses of Lemma [71 The number of vertices v' of G'(P, f) 
is at least D% ~ r — ► oo. The minimum degree is at least 3 because it is a kernel. The number 
of edges incident to cycles of length at most k (including loops and parallel edges) is at most 
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y/r = o(r) = o(v') since Pq G V*. The number of edges incident to vertices of degree greater 
than 3 is at most 

J2 dj < 2 £(d,--2) 

= 2J2(dj-2)-2D 3 

3 

= 2r-2D 3 
= o{r) 

which is o(v'). The number of edges m' satisfies 

2m' = d 3 

j:dj>3 

= 3D 3 + o(r) 

by the previous calculation, so m! = 0(D 3 ) = 0(r) = o(v). To see that m! is little-oh of 
the sum N of the edge- weights, observe that N is the number of edges of G(P, /), which is 
J2j dj/2 > v. Next observe that the edge weights form a sequence of positive integers that is 
determined by the assignment / in the random configuration. There are exactly \{i : di — 2}|! 
choices for / that produce any given sequence, so the sequence is chosen uniformly at random. 
We have shown that the hypotheses of Lemma [7] hold for G'(P, f) conditioned on P = P , so 
we have P[A \ P = P ] = 1 - o(l). Now 



P[A] > P[A\P = P*}P[P = P* 

> P[A | P = Pq] P [ P = P * 



by the choice of Pq. Since we showed P G V* a.a.s. we get P[A] = 1 — o(l); that is, the 
heaviest cycle in G'(P, f) a.a.s. has weight at most c*v. But if C is a cycle in G(P, f) of some 
length I, C corresponds naturally to a cycle in G'(P, f) of weight I. So the longest cycle in 
G(P, f) a.a.s. has length at most c*v. I 

Corollary 12 Let d = d(n) be a prekernel degree sequence satisfying v = v(d) — > oo, r = 
r(d) — > oo, r = o{v), D 3 = D 3 (d) ~ r, and 

'di 



^ \2 

i:di>3 



< 4r. 



Fix k > 2 and suppose that the positive constant c* is k- admissible. Let G be chosen uniformly 
at random from all prekernels with degree sequence d. The longest cycle in G has length a.a.s. 
at most c*v as n — ► oo. 

Proof. The probability space T~t(d), conditioned on the event that G(P, f) is a simple graph, 
is a uniform probability space on the prekernels with degree sequence d ([18], Lemma 3). By 
Lemma 5 in [18], G(P, f) IS cL.cL.S. cL simple graph. (In fact, Lemma 5 in [18] is stated with 
an additional hypothesis on maxdj, but this hypothesis is not used in the proof.) The result 
now follows from Lemma [TT] | 
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6 Truncated multinomial distribution 



In order to apply Corollary [12] to the prekernel of Q(n,M), we must verify the hypotheses 
about properties of the degree sequence. We give a new derivation of these properties, which 
will require some facts about the following distribution. 

Let v and t be positive integers. The probability space Multi(t>, t) consists of vectors 
(di, d 2 , . . . , d v ) with distribution 

t\ 

P[di =ji,d 2 = j 2} ...,d v = j v ] 



for any vector (ji,j 2 , ■ ■ ■ ,jv) of nonnegative integers summing to t. This is the well-known 
multinomial distribution, modelling the number of balls in each bin when each of t balls is 
tossed into one of v bins, independently and uniformly at random. The space Multi(v , t)\> 2 
is obtained from Multi(v, t) by conditioning on the event that each di > 2. 

Lemma 13 Let v = v(n) and r = r(n) satisfy v — > oo, r —>■ oo and r = o(v). If the random 
vector d is distributed as Multi(i>, 2v + r)\> 2 then a.a.s. D 3 (d) ~ r and 

E (f) < 

i:di>3 V / 

Proof. Define the positive number A by 

A(e A -l) | r 
e x — 1 — A v' 

In |3j, the authors show that A exists and they use it to define a vector of independent 
truncated Poisson random variables which approximate Multi(v, 2v + r)|>2 as follows. Define 
the random variable Y taking values j — 2, 3, . . . according to the distribution 

X j 

P[Y = j]= Pj 



jV-i-a)" 

Consider the probability space formed by vectors Y = (Yi, Y 2 , . . . , Y v ) of v independent copies 
of Y and let S be the event that their sum satisfies ^ f Yi = 2v + r. For nonnegative integers 
ji,j 2 , ■ ■ ■ ,j v summing to 2v + r with each j% > 2 we have 



py.=A.-.y. = A,] = (e> ^A). n^i 



so this probability space, conditioned on E, is identical to Multi(t> , 2t> + r)|> 2 . Equation (5.7) 
in [18] says that 



maxYj < logf or | * j > 4r | E 

* i:Yi>3 ^ ' 



Ofr 1 + rv 1 
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The second claim in the lemma follows. Theorem 4(a) in [17J states that for r — > oo, 

P[E] 1+ ° (r " ) 



y/2-KVc(l + Tj — c) 

where c(l + — c) ~ c — = r/t> by Equation (20) in [T7]. It follows that 

Pp]- 1 =0(VF). (6.1) 

To establish the first claim in the lemma, we observe that D 3 (Y) is distributed as a binomial 
random variable with v trials and p 3 probability of success. By Chernoff 's bound, 

P [\D 3 (Y) - vp 3 \ >a]< 2exp(-a 2 /{3vp 3 )) 

for < a < vp 3 . Recalling (16. ip . in Multi(f , 2v + r)|>2 we have 

P [|£> 3 (d) - vp 3 \ >a] = 0{y/r) exp(-a 2 /{3vp 3 )). 

Setting a = ^/vp 3 log r (which satisfies a < vp 3 , as we will see shortly) we get D 3 (d) = 
vp 3 +0(y/vps logr) with probability 1— 0(exp(— (logr) 2 /3)). Now A ~ Srv" 1 by Theorem 1(a) 
in [T7|, so 



A :J 



vp 3 



3!(e A -l-A) 
A 3 



3!(A 2 /2 + 0(A 3 )) 
lvX(l + 0(X)) 
r(l + 0(rv- 1 )) 



giving us D 3 (d) ~ r a.a.s. as required. 



7 Properties of vertex degrees in Q{n,M) 

Now we proceed to establish the properties of the degree sequence of the prekernel of Q (n, M) 
that are required to apply Corollary [121 Recall that we are assuming M = M(n) = n/2 + s 
for some s = s(n) satisfying s = o(n) and n 2 / 3 = o(s). For this range of M, it is well-known 
that Q{n, M) a.a.s. has a unique component with maximum number of vertices [2], which we 
call the largest component. 

We begin by showing a.a.s. there are few vertices in the core that lie outside the largest 
component. The next result is part of the proof of Theorem 4 of |16j . Here we present a 
slightly more thorough proof. 

Lemma 14 Let M = M(n) = n/2 + s for some s = s(n) satisfying s = o(n) andn 2 ^ 3 = o(s). 
The number of vertices in cycles of Q(n,M) not in the largest component is a.a.s. at most 
ujnj s for any uj = u)(n) — > oo. 
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Proof. Let G be the graph formed from Q(n,M) by removing its (lexicographically first) 
largest component. Let n{G) and M{G) represent its number of vertices and edges, respec- 
tively Let e > and define S to be the set of ordered pairs (n, M) satisfying 

1. (1 - e)4s < n - n < (1 + e)4s, 

2. (1 - e)As < M - M < (1 + e)4s, and 

3. P[n(G) = n, M(G) = M] > 0. 

It is known that the largest component of M) has a.a.s. 4s(l + o(l)) vertices and 4s(l + 
o{\)) edges [2JCE5]. So a.a.s. (n(G),M(G)) G 5. For (n, M) G S we have M < M-4s(l-e) = 
n/2 + s — 4s(l — e) and n < n + 4s(l + e), giving us 



To estimate the number X of vertices in cycles in G we let (n, M) £ S* and condition on the 
non-empty event = n, M(G) = M. In the conditioned space, G is equally likely to be 

any graph on n vertices and M edges. For 3 < k < n the number of such graphs having a 
cycle of length k is at most 



M < n/2 — s(l - 6e). 



(7.1) 




so the expected value of X in this conditioned space is 



E[X | n(G) = n, M(G) = M) < 



^ \kJ2k\M-kJ\M J 



< 




2^{n-k)\ (©-M)! (M — A;)! 



2 fe((^)--) fc ■ 



If fi! (©-M-ife)! M! 



1 A n fc M fc 



Using (17.11) this becomes 



E[X\n(G)=n,M(G) = M] < \ ( I ^— 

A k=3 V 2 1 / 




A: 



1 n-3 



2 s(l-6e) 
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which is at most n/s for n sufficiently large. By Markov's inequality, 



/ , »vi E[X I n(G) = n, M(G) = M] 

V[X>uon/s | n(G) = n, M(G) = M] < — — — ' ! 



uon/ s 
1 

< — . 



So 



P[X < uon/s] > PLY < un/s, (n(G),M(G)) G S] 

= (P[X <iun/s\n(G) =n,M(G) = M] 



(n,M)GS 

xP[n(G) =n,M(G) = M]) 
1 



> M__ P[( n (G),M(G))G5] 



l-i) (l-o(i)) 

since a.a.s. (n(G),M(G)) G S. Therefore a.a.s. X < ujn/s. | 

Instead of proving results about the degree sequence of the prekernel of Q (n, M) directly, 
we will actually prove results about the degree sequence of the core. The next result will 
allow us to transfer results about the core to the prekernel. 

Lemma 15 Let M = M(n) = n/2 + s for some s = s(n) satisfying s = o{n) andn 2 ^ 3 = o(s). 
The core of the largest component of Q(n, M) is a.a.s. formed from the core ofQ(n,M) by 
removing o(s 2 /n) vertices of degree 2. Also, the prekernel of Q(n, M) is a.a.s. formed from 
the core of Q{n, M) by removing o(s 2 /n) vertices of degree 2. 

Proof. It is well-known that the largest component of Q(n,M) is a.a.s. the only compo- 
nent that has more than one cycle. (See Theorem 5.12 in [IT].) So, the core of Q(n,M) 
is a.a.s. composed of the core of the largest component together with some cycle compo- 
nents. By Lemma [TH the number of vertices in the cycle components is a.a.s. at most 
ujn/s = (s/n 2//3 )(n/s) = n 1 / 3 = (n 2//3 ) 2 /n = o(s 2 /n), since we may take uj = s/n 2 ^ 3 . Be- 
cause the largest component of Q(n,M) a.a.s. contains more than one cycle, it follows that 
these cycle components s. all of the cycle components in the core of Q(n, M), making 

the prekernel a.a.s. equal to the core of the largest component. I 

Now we establish the required properties of the degree sequence of the prekernel of Q(n, M). 
Recall the definitions of Dj(d), v(d), r(d) from (15.11) . The results about r(d), v(d), and D 3 (d) 
in the following lemma were used by Luczak [16]. He used the estimate of r(d) from [15] to 
establish estimates for Dj(d) and v(d) by direct enumeration over degree sequences. Instead 
of studying the prekernel directly, he studied the core of the largest component. Our proof 
method is different, using the known estimates of v (d) and r(d) to establish results about the 
degree sequence of the core. 
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Lemma 16 Let M = M{n) = n/2 + s for some s = s(n) satisfying s = o(n) and n 2 / 3 = o(s) 
Let d be the degree sequence of the prekernel of Q(n, M) . Then, a.a.s. v(d) ~ 8s 2 /n, r(d) ^ 
D 3 (d) ~ 32s 3 /(3n 2 ) ; and 



]T 2 (f)<4r(d). 



i:di>3 

Proof. By Lemma [151 the prekernel differs from the core a.a.s. by o(s 2 /n) vertices of degree 
2. Thus, it suffices to prove the lemma for the degree sequence d of the core. Appealing to 
Lemma [TS] again, the core differs from the core of the largest component a.a.s. by o(s 2 /n) 
vertices of degree 2. It is known [T5] that the degree sequence d' of the core of the largest 
component a.a.s. has f(d') ~ 8s 2 /n and r(d') ~ 32s 3 /(3n 2 ), so we must have a.a.s. v(d) ~ 
8s 2 /n and r(d) ~ 32s 3 /(3n 2 ) also. Letting e > 0, this means d G S a.a.s. where S is the set 
of ordered pairs (v, f) satisfying 

1. (1 - e)8s 2 /n < v < (1 + e)8s 2 /n, 

2. (1 - e)32s 3 /(3n 2 ) < f < (1 + e)32s 3 /(3n 2 ), and 

3. P[u(d) = v,r(d) = f] > 0. 

We note that for (v,f) G S we have f = o(v) since (s 3 /n 2 )/(s 2 /n) = s/n = o(l) and both 
w — > oo and f — > cxd since n 2//3 = o(s). 

To establish the remaining properties of the degree sequence of the core of Q(n, M) we use 
Theorem 2 of [3], which proves the existence of a probability space of ordered pairs (G, I) in 
which 

1. G, conditioned on the event I — 1, is distributed as the core of Q(n, M), 

2. P[J = 1] = and 

3. the degree sequence d(G) of G, conditioned on v(d(G)) = v and r(d(G)) = f, is 
distributed as Multi(t), 2v + r)|> 2 . 

(The statement of Theorem 2 of Q actually includes the hypothesis M > n which is not 
satisfied here; however, that hypothesis is not needed for their proof.) 
Write v = v(d(G)), r = r(d(G)) and let A be the event that 

1. (1 - e)32s 3 /(3n 2 ) < D 3 (d(G)) < (1 + e)32s 3 /(3n 2 ), and 

2- E^( G) >3 2(* ( 2 G) )<4r. 

To prove the lemma, we must show P[A \ I = 1] = 1 + o(l) or equivalently P[y4 c \ I = 1] = 
o(l), where X c denotes the complement of event X. We begin by writing 

P[A C | / = 1] = P[A C , (v, r) G S | / = 1] + P[A C , (v,r)#S\I=l]. 
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The second term is at most P[(v,r) G" S \ I = 1] which is o(l) because d(G), conditioned on 
I — 1, is distributed like the degree sequence of the core of Q(n, M). We write the first term 

as 

P[A C , (v,r) G S | / = 1] = p [ A °i v = v,r = f | I = 1] 

(v,r)eS 

= P[I = 1] _1 P[A c ,v = v,r = f,I = 1] 

< P[/=l]- 1 Y P[A°\v = v,r = f] 

(v,f)£S 



P[/=l] 

(v,r)€S 



_ 1 ^ P[A C ,v = v,r = r]P[v = v,r 
' P[f = u, r = f] 



r 



= P[J = l]" 1 P[A C | v = v, r = f]P[v 

(v,r)eS 

< P[I = l]- l P[A c \ v = v,r = r]P[(v, r) G S] 

where (v,r) is the ordered pair maximizing PfA \ v = v,r = f] over all (v,f) G S. (The 
maximum exists because S is finite.) We now use the properties of the distribution of (G, I) 
to estimate each of P[J = l] -1 , P[A C | v = v,r = r], and P[(u, r) G £]. We have already 
noted that P[I = 1] = so we have P[I = = 0(1). Since (v,r) G S we have 

?) — > oo, f — > oo, and f = o(v). We know that, conditioned on v = v and r = f, the degree 
sequence of G is distributed as Mult i(v, 2v + r)|> 2 . Lemma [131 tells us that event A occurs 
a.a.s. in this model so we have PfA | v — v, r — r] — o(l). Finally, we may crudely estimate 
P[(v,r) G S] — 0(1). Combining these estimates we get P[A C \ I = 1] = o(l) as required. 



8 Circumference of ^(n, M) 

Lemma 17 Let M = M{n) = n/2 + s for some s = s(n) satisfying s = o{n) andn 2 ^ 3 = o(s). 
Fix k and suppose that the positive constant c* is k-admissible. The circumference ofQ(n, M) 
is a.a.s. at most (8c* + o(l))s 2 /n. 

Proof. Every cycle in a graph lies in the graph's core. By Lemma [15] the prekernel G 
of Q(n,M) is formed from the core of Q(n,M) by removing o(s 2 /n) vertices of degree 2. 
So, to prove the lemma, it suffices to show that the circumference of G is a.a.s. at most 
(8c* + o(l))s 2 /n. 

By Lemma [TBI there exists uo = u(n) — > oo such that the degree sequence d(G) of G a.a.s. 
lies in the set D of prekernel degree sequences d satisfying 



Wf)<4r(d), 

:d,>3 ^ ' 
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2. (1 - uj- 1 )8s 2 /n < v(d) < (1 + u' 1 )8s 2 /n, 

3. (1 - c^- 1 )32s 3 /(3n 2 ) < r(d) < (1 + c^- 1 )32s 3 /(3n 2 ), 

4. (1 - cj- 1 )32s 3 /(3n 2 ) < D 3 (d) < (1 + cj- 1 )32s 3 /(3n 2 ), and 

5. P[d(G) = d] > 0. 

Define A to be the event that the circumference of G is at most c*v(d(G)). We have 

P[A] >J2 P l A \ d ( G ) = d ] p [ d ( G ) = d]. 

deD 

Suppose that P[A | d(G) = d] is minimized over d G Z) by d = d*. (The minimum exists 
since D is finite.) Then 

P[A] > P[A | d(G) = d*] P[d(G) = d] 

deD 

= P[A I d(G) = d*](l +o(l)) 

since a.a.s. d G D. 

In general, the number of graphs on n vertices and M edges that have a given graph as 
their prekernel depends only on the number of vertices and edges of the given prekernel. So, 
conditioning on the event d(G) = d*, G is equally likely to be each prekernel with degree 
sequence d*. The probability P[A \ d(G) = d*] is thus the probability that a graph, chosen 
uniformly at random from all prekernels of degree sequence d*, has circumference a.a.s. at 
most c*v. Since d* G D, we may apply Corollary [T2] to conclude that this probability is 
1 + o(l). Thus P[A] = 1 + o(l). In other words, the circumference of G is a.a.s. at most 
c*v(d(G)). But we have seen v(d(G)) ~ 8s 2 /n a.a.s. so the circumference is a.a.s. at most 
(8c* + o(l))s 2 /n, as required. I 

Proof of Theorem [H Proposition [10] tells us that c* = 0.8697 is fc-admissible for 
k = 9. By Lemma fT71 the circumference of Q(n,M) is a.a.s. at most (8c* + o(l))s 2 /-n, < 
(6.958 + o(l))s 2 /n. I 
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